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ABSTRACT 

A  brief  history  of  the  development  of  tables  of  random  numbers  is 
presented.  This  is  followed  by  an  investigation  of  the  generation  of 
uniformly  distributed  random  numbers  on  the  CDC  160li  digital  computer. 
Certain  congruential  relations  are  examined  in  order  to  determine 
parameters  that  will  produce  acceptable  random  number  sequences.  Sta¬ 
tistical  tests  of  the  generated  numbers  are  described  and  the  results 
of  these  tests  are  presented. 
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PREFACE 


Statistical  sampling  techniques  using  a  "Monte  Carlo"  method  can 
require  a  large  supply  of  random  numbers.  Such  techniques  have  been  used 
in  the  evaluation  of  definite  integrals,  the  solution  of  ordinary  and 
partial  differential  equations,  the  solution  of  integral  equations  and 
matrix  inversion,  to  mention  only  a  few  applications.  On  the  other  hand, 
some  problems,  described  in  terms  of  a  stochastic  model,  can  only  be 
solved  by  recourse  to  sampling  techniques,  as  for  example,  mathematical 
war  gaming  problems . 

When  such  models  are  programmed  for  a  high  speed  digital  computer, 
it  is  necessary  to  produce  large  numbers  of  random  numbers  quickly.  The 
tables  of  random  digits  or  random  uniform  numbers  customarily  used  in 
hand  calculations  do  not  lend  themselves  to  use  in  the  high  speed  compu¬ 
ter.  It  is  inefficient  to  read  into  the  computer  the  large  number  of 
random  digits  required  to  meet  the  demand  for  such  numbers  arising  in 
problems  which  are  otherwise  within  the  capabilities  of  the  machine. 

Using  the  same  numbers  repeatedly  in  a  given  problem  would  introduce 
bias. 

It  has  been  suggested  that  special  equipment  which  utilizes  a  phys¬ 
ical  random  process  be  used  to  produce  the  random  numbers.  On  the  sur¬ 
face,  this  would  seem  desirable  but,  in  practice,  difficulties  immedi¬ 
ately  arise.  First,  the  construction  and  maintenance  of  such  a  device 
would  be  quite  expensive.  To  be  useful,  it  would  be  required  to  produce 
a  number  in  the  order  of  a  few  microseconds  for  extended  periods.  Reli¬ 
ability  requirements  would  necessitate  its  output  to  be  continually 
checked  for  randomness.  The  second  difficulty  is  that  such  a  random 
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device  would  not  permit  the  recalculation  of  a  problem  using  the  same 
generated  numbers.  This  could  prove  to  be  a  handicap  in  checking  on 
machine  operations. 

These  difficulties  lead  to  the  exploration  of  arithmetical  schemes 
for  generating  numbers  which  take  advantage  of  the  computer's  high  speed 
and  place  minimum  demands  on  computer  memory  storage.  Such  numbers 
could  not  be  considered  truly  random;  hence,  the  adoption  of  the  term 
"pseudo-random  numbers" . 

It  is  the  intent  of  this  paper  to  present  a  brief  historical  devel¬ 
opment  of  random  number  generation  and  to  explore,  in  particular,  the 
multiplicative  congruential  type  of  generator  for  adaptation  to  the 
CDC  I60U. 
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I .  INTRODUCTION 


1.  Kendall  and  Smith's  Random  Sampling  Numbers. 

It  might  be  supposed  that  if  one  wanted  to  obtain  a  sequence  of 
random  digits  that  almost  any  method  that  employed  some  sort  of  chaotic 
selection  would  suffice.  Observing  the  indications  of  a  roulette  wheel 
or,  perhaps,  an  unsystematic  selection  of  digits  from  a  large  telephone 
directory  might  seem  like  "reasonable"  methods.  Once  the  digits  were 
collected,  however,  a  problem  would  arise.  By  what  criterion  could 
their  randomness  be  judged? 

It  was  to  this  problem  that  M.  G.  Kendall  and  B.  Babington  Smith 

addressed  themselves  in  their  now  classic  work,  "Randomness  and  Random 

r  i  i 

Sampling  Numbers".  jlj  They  start  their  paper  with  a  discussion  on 

the  relation  between  randomness  and  probability. 

In  colloquial  speech  the  word  "random’1  is  applied  to  any  method 
of  choice  which  lacks  aim  or  purpose;  and  this  usage  is  also  found 
in  certain  sciences.  In  statistics,  however,  the  word  has  a  some¬ 
what  different  and  more  definite  significance,  closely  related  to 
probability.  It  appears,  in  fact,  that  for  statistical  purposes 
the  ideas  of  randomness  and  probability  are  inseparable,  whether 
one  belongs  to  the  "intuitive"  school  which  regards  probability  as 
an  undefinable,  or  to  the  opposing  "frequency"  school  which  seeks 
to  explain  it  in  terms  of  statistical  frequencies. 

Having  linked  the  concepts  of  probability  and  randomness,  Kendall 

and  Smith  concede  that  there  are  many  controversial  areas  that  pose 

abstract  problems  that  are, —  "verging,  at  times,  on  the  theological." 

In  the  interest  of  getting  on  with  the  more  mundane  problems  of  Random 

Sampling  Numbers,  they  state: 

We  take  randomness  and  probability  to  be  undefined  ideas  obeying 
certain  intuitively  formulated  principles,  which  will  be  found  suf¬ 
ficient  to  give  results  of  practical  application. 


^Numbers  in  brackets  refer  to  references  cited  in  the  Bibliography  at 
the  end  of  the  paper. 
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The  authors  note  that  any  combination  of  digits  could  be  considered 
a  random  set,  in  that  all  combinations  are  possible  when  drawing  from  an 
infinite  universe  of  digits.  However,  not  all  combinations  are  practical 
for  random  sampling;  for  example,  a  block  of  digits  which  are  all  the 
same.  For  purposes  of  sampling,  a  definition  of  Random  Sampling  Number 
i§  made:  a  set  of  numbers  which  can  be  used  for  random  sampling,  not 
necessarily  a  set  obtained  by  random  methods.  It  is  Kendall  and  Smith's 
intention  to  produce  a  table  of  random  digits  and  to  test  these  for 
acceptability  as  Random  Sampling  Numbers.  That  is,  Random  Sampling  Num¬ 
bers  is  a  collective  term  for  a  series  of  random  digits. 

Such  a  set  of  Random  Sampling  Numbers  should  conform  to  the  notion 
of  relative  frequency.  That  is,  if  samples  were  taken  from  a  universe 
consisting  of  an  infinite  number  of  digits  such  that  each  of  the  ten 
digits  0  through  9  may  be  expected  to  appear  with  the  same  frequency, 
then,  it  is  expected  that  in  a  large  number  of  trials  each  of  the  digits 
will  appear  approximately  an  equal  number  of  times.  Further,  it  would 
be  expected  that  each  pair  of  digits  would  occur  approximately  the  same 
number  of  times,  similarly  triples,  etc.;  that  is,  no  digit  would  tend 
to  follow  another  digit  in  any  consistent  pattern. 

Here  then  is  an  important  distinction:  any  given  set  of  N  numbers 
need  not  follow  such  expectations  but  a  set  of  Random  Sampling  Numbers 
should  do  so.  Hereafter,  we  shall  identify  such  a  set  of  Random  Samp¬ 
ling  Numbers  as  a  random  set. 

In  order  to  ascertain  whether  a  given  set  of  digits  is  a  good  enough 
"approximation"  to  a  random  set,  the  observed  frequencies  may  be  compared 
to  the  theoretical  frequencies  by  means  of  a  chi-square  test.  On  "purely 
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arbitrary  grounds" ,  Kendall  and  Smith  elected  to  say  that  a  set  of  num¬ 
bers  is  acceptable  if  the  resulting  chi-square  statistic  is  within  the 
range  of  0.01  —  P  —  0.99,  where,  with  0j  the  observed  and  Ej  the 
expected  frequencies  of  the  jth  digit, 


and 


x2  =  I 

j  =  O 


(0.i  -  V 


P 


o 


dX 


9-1 

X  dX 


with  9  being  the  number  of  degrees  of  freedom. 

Having  thus  determined  the  characteristics  of  this  set,  the  authors 
then  proceed  to  establish  a  series  of  four  tests  for  a  set  of  digits  to 
be  considered  as  a  random  set.  Subsequently,  these  tests  have  become 
standard  in  the  literature  on  random  numbers.  Briefly,  they  are  as  fol¬ 
lows: 

(a)  The  frequency  test  to  determine  if  all  the  digits  of  a  set 
occur  in  approximately  equal  amounts. 

(b)  The  serial  test  which  tests  to  see  if  any  digit  tends  to 
follow  another. 

(c)  The  poker  test  which  is  named  after  the  card  game  and  examines 
the  digits  in  blocks  of  five  for  five  of  a  kind,  four  of  a  kind 
etc.  The  frequency  of  these  hands  are  then  compared  with  the 
expected  distribution  of  such  hands. 
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(d)  The  gap  test  which  inquires  into  the  gaps  occurring  between  the 


same  digits  in  a  series  and  compares  this  with  the  theoretical 
distribution. 

Although  these  tests  are  not  sufficient,  the  authors  feel  that  collec¬ 
tively  they  are  powerful. 

With  the  foregoing  criteria  established,  Kendall  and  Smith  attempt 
to  set  up  a  random  series  by  selecting  digits  from  the  London  Telephone 
Directory  in  a  haphazard  fashion.  Sven  though  they  took  what  would  seem 
like  extra  precautions  to  avoid  bias,  the  results  of  their  tests  indi¬ 
cated  significant  bias. 

They  then  constructed  an  electro-mechanical  device  to  produce  random 
digits.  Essentially,  it  consisted  of  a  wheel  divided  into  ten  equal  seg¬ 
ments  with  a  digit  assigned  to  each  segment.  It  was  driven  by  an  elec¬ 
tric  motor  whose  speed  was  accurately  controlled.  The  operator  on  a 
separate  circuit  controlled  a  neon  lamp  whose  flash  was  of  such  a  short 
duration  that  the  moving  wheel  appeared  to  be  stationary.  The  number 
that  appeared  under  a  previously  fixed  pointer  was  recorded  as  the  ran¬ 
dom  digit.  The  actuation  of  the  lamp  was  kept  haphazard  by  having  the 
operator  move  a  stylus  over  an  intricate  maze  of  circuitry  which,  it  was 
maintained,  caused  the  light  to  operate  in  a  random  fashion. 

A  table  of  100,000  digits  was  produced  in  this  fashion  and,  based 
on  their  tests,  were  considered  acceptable  as  Random  Sampling  Numbers  £lj. 

2.  RAND  Corporation's  Table  of  Random  Digits. 

Kendall  and  Smith's  table  appears  to  have  been  satisfactory  for  a 
number  of  years;  at  least,  there  were  no  other  tables  published.  In 
19^7,  a  group  of  research  workers  in  the  RAND  Corporation  responded  to  a 
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growing  need  for  random  numbers  by  producing  a  table  of  one  million  ran¬ 
dom  digits  2^  .  These  were  needed  to  solve  various  computerized 
research  problems  by  experimental  probability  procedures .  Such  proce¬ 
dures  were  called  by  the  colorful  name,  Monte  Carlo  Methods,  For  many 
of  the  problems  that  RAND  was  working  on,  the  need  for  numbers  far 
exceeded  those  of  Kendall  and  Smith's  Tables.  In  fact,  it  meant  that 
these  tables  would  have  to  be  used  time  and  again  in  a  particular  prob¬ 
lem.  This  presented  the  consequent  danger  of  introducing  unwanted  cor¬ 
relations.  Such  a  growing  thirst  for  random  numbers  was,  of  course,  due 
to  new  problems  and  rapidly  developing  computer  technology  available  for 
solving  the  se  problems . 

RAND'S  table  was  produced  by  electronic-mechanical  means  and  was 

apparently  more  sophisticated  than  Kendall  and  Smith's  machine. 

In  principle  the  machine  was  a  32 -place  roulette  wheel  which  made, 
on  the  average,  about  3,000  revolutions  per  trial  and  produced 
one  number  per  second. 

It  appears  that  the  engineers  designing  the  device  did  have  many 
problems  because  the  original  machine  showed  statistically  significant 
biases  and  had  to  be  modified  extensively.  Further,  even  though  elec¬ 
tronic  checks  were  continually  made,  after  a  month  the  results  of  tests 
of  a  block  of  numbers  produced  showed  significant  bias.  This  indicated 
that  the  machine  was  running  down  even  though  the  electronic  checks  were 
acceptable . 

3.  Pseudo-random  Number  Generation. 

While  RAND’S  table  was  suitable  for  work  with  punched  card  compu¬ 
ters,  it  proved  impractical  with  the  advent  of  high  speed  electronic 
computers  because: 


(a)  it  placed  severe  storage  requirements  on  the  machine 

(b)  reading  in  the  tables  was  slow 

(c)  much  larger  tables  were  required  for  the  solution  of  certain 
types  of  problems. 

These  shortcomings  led  to  an  interest  in  methods  for  producing 
"pseudo-random"  numbers  by  arithmetic  means.  4  definition  of  this  tern 
was  given  by  D.  H.  Lehmer  as: 

A  pseudo-random  sequence  is  a  vague  notion  embodying  the  idea  of 
a  sequence  in  which  each  term  is  unpredictable  to  the  uninitiated 
and  whose  digits  pass  a  certain  number  of  tests  traditional  with 
statisticians  and  depending  somewhat  on  the  uses  to  which  the 
sequence  is  to  be  put. 

One  of  the  first  methods  used  was  proposed  by  von  Neumann  and  Metro¬ 
polis  wherein  an  arbitrary  n-digit  number,  Yq  ,  was  selected.  This  num¬ 
ber  was  then  squared  and  was  produced  as  an  n-digit  number  from  the 
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center  of  the  2  n-digits  of  Yq  .  The  process  was  then  repeated,  produc- 
ing  Yi+1  from  Y.^. 


Although  this  process  generated  pseudo-random  numbers,  its  period 


was  not  predictable.  Further,  it  was  demonstrated  by  D.  H.  Lehmer  3 
that,  in  general,  it  would  not  produce  a  very  long  cycle  before  the 
process  degenerated  into  a  sequence  of  zeros.  Consequently,  it  could 
not  be  recommended  as  a  source  of  great  quantities  of  random  numbers. 

During  the  second  symposium  at  the  Harvard  Computation  Laboratory, 


D.  H.  Lehmer  suggested 
tion  (See  Appendix  A) 


that  the  multiplicative  congruential  rela- 


Yr+1  -  L  Yr  (raod  M)  (1-D 

could  be  used  where  the  least  positive  residue  is  taken  as  the  generated 


6 


number.  In  effect,  this  statement  says  that  (  Yr+^  “  YYr  )  /  M  is  an 
integer.  The  modulus  M  could  be  selected  so  as  to  be  compatible  with 
the  base  of  the  machine  being  used;  that  is,  it  could  be  of  the  form 
(2P)  or  (10P) .  P  normally  is  selected  to  be  the  number  of  positions 
available  in  a  word  of  the  computer  being  used.  Further,  in  a  binary 

machine,  with  a  proper  selection  of  L,  the  period  of  this  generator  is 

2(p-2). 

Using  P  in  this  way  avoids  the  necessity  for  performing  the  final 
division  operation  in  most  high  speed  computers.  For  example,  in  the 
CDC  I60U  using  a  modulus  of  2^,  the  multiplication  (  MUI  )  of  the  number 
L  by  Yr  positions  the  product  in  the  QA  registers.  The  U7  low-order 
bits  of  this  product  in  the  A  register  are,  in  fact,  the  desired  resid¬ 
ual  or  the  newly  generated  random  number. 

Although  such  a  method  is  relatively  fast,  nevertheless,  it  does 
use  the  multiplication  command  which,  for  a  computer,  is  time  consuming. 


Greenberger 


proposed  that  L  in  equation  (1-1)  be  modified  to 


=  (  2A  +  3  )  Yr 


(mod  2P) 


(1-2) 


with  A  —  3.  By  doing  this,  the  multiplication  could  be  performed  by  a 

shift  command  and  3  add  commands  which  represented  a  considerable  saving 

in  time  over  the  straight  multiplication  instruction.  Further,  he  shows 

5 i  that  an  L  of  this  form  does  satisfy  the  conditions  necessary  to 

P-2 

insure  the  full  period  of  2 

Another  variation  of  equation  (1-1)  was  proposed  by  Rotenberg 
whose  generator  became 


Lr+1 


(  2A  +  1  )  Yr  +  G 


(mod  2P) 


(1-3) 
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This  again  was  faster  than  Greenberger ' s  model  by  one  add  instruc- 
tion  but,  unlike  the  former,  this  gave  a  full  period  of  2  for  A  —  2 


and 


III 


C  an  odd  integer . 

Equations  (1-2)  and  (1-3)  will  be  investigated  in 
for  use  as  pseudo-random  number  generators  for  the 


Chapters  II  and 
CDC  l60ii  computer. 
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II.  EMPIRICAL  TEST  PROCEDURES 

1.  General 

Previous  studies  made  using  other  machines  5  6^  [7 

that  the  quality  of  the  generated  pseudo-random  numbers  using  equations 
(1-2)  and  (1-3)  can  be  very  sensitive  to  the  selection  of  A.  Aside  from 
a  guarantee  of  the  period  length  (See  Appendix  A),  there  is,  presently, 
little  mathematical  theory  to  aid  in  the  selection  of  parameters  that  will 
enhance  the  random  attributes  of  the  generated  samples.  The  task  is 
largely  an  empirical  one  wherein  an  A  and  a  starting  value,  YQ,  are  se¬ 
lected  and  the  subsequent  output  is  submitted  to  a  battery  of  tests. 

Such  empirical  analysis  can  only  give  general  indications  as  to  desirable 
parameters  because  it  is  only  practical  to  investigate  a  small  portion 
of  the  total  cycle  of  the  generator.  This  fact  can  be  appreciated  by 
observing  that  using  a  modulus  of  2^  in  (1-2)  the  fraction  of  the  full 

period  that  would  be  considered,  if  even,  say  10  elements  were  tested, 

-3 

is  approximately  10 

As  has  been  cited  in  Chapter  I,  the  tests  for  randomness  offered  by 
Kendall  and  Smith  were  designed  to  check  on  the  randomness  of  the  indi¬ 
vidual  digits  of  their  table  of  random  numbers.  The  generators  under 
discussion  here,  however,  produce  pseudo-random  numbers.  Consequently, 
those  tests  that  lend  themselves  to  investigating  certain  properties 
associated  with  random  numbers  rather  than  those  associated  with  indi¬ 
vidual  digits  are  favored  in  the  investigative  work  herein  reported. 

Should  a  requirement  arise  for  random  digits  when  using  equations  (1-2) 
and  (1-3),  only  the  high  order  (left-most)  bit  positions  of  the  binary 
representation  of  the  generated  number  diould  be  considered.  (See 
Appendix  A ) . 
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2.  Uniformly  Distributed  Random  Variables. 

Many  uses  of  computerized  sequences  of  random  numbers  require  that 
they  be  uniform  0,1  .  Not  only  is  this  distribution  used  in.  its  own 
right  but  numbers  from  this  distribution  may  be  used  to  produce  numbers 
with  other  distributions.  Converting  the  generated  pseudo-random  numbers 
of  equations  (1-2)  and  (1-3)  to  uniform  0,1  is  quickly  accomplished  in 
a  high  speed  digital  computer  as  it  merely  requires  the  positioning  of  a 
decimal  point  to  the  left  of  the  generated  numbers.  (See  Appendix  B.) 

3.  Description  of  Tests. 

The  following  tests  were  performed  on  the  blocks  of  numbers  forming 
a  generated  sequence. 

(a)  Frequency  test:  The  classic  chi-square  goodness  of  fit  test 
was  used  as  a  frequency  test  |V  •  This  test  divides  the  unit  interval 
into  k  equal  parts.  The  generated  numbers  were  each  checked  and  a  tally 
kept  on  their  frequency  distribution  within  the  k  intervals.  This  is, 
perhaps,  the  most  frequently  applied  of  the  random  tests. 

Attempts  have  been  made  to  establish  some  sort  of  an  optimal  size 
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,  present  a  procedure  by 


for  k.  Mann  and  Wald,  in  their  paper 
which  the  lengths  of  the  class  intervals  are  determined  so  that  the  prob¬ 
ability  of  each  class  under  the  null  hypothesis  is  equal  to  1/k  where  k 
is  the  number  of  class  intervals.  They  establish  a  Theorem  which  says 
that,  in  the  limit,  the  best  (defined  by  them)  value  of  k  is  given  by 


■2“  d 
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is  equal  to  the  size  of  the  critical  region  (probability  of  the  critical 
region  under  the  null  hypothesis)  and  H  is  the  number  of  random  elements 
generated.  For  significance  levels  of  ,01  and  .05  the  corresponding 
values  of  C  are  2.327  and  1.61*5  respectively.  These,  in  turn,  lead  to 
k  «=  88  for  ,01  and  k~  111*  for  .05  with  N  5000.  A  k  =  100  was  used  in 
the  large  sample  tests  reported  on  in  Chapter  III. 

(b)  Serial  Correlation:  This  test  was  set  up  following  the  formula 
given  by  Kendall  [$?]  •  Lags  from  1  through  10  were  examined. 

(c)  Run  test:  The  number  of  runs  up  and  down  were  examined  and  com¬ 
pared  with  the  expected  theoretical  distribution  using  a  chi-square  test. 
The  first  number  to  be  examined  is  compared  with  the  next  number  in  the 
sequence.  If  the  second  is  larger  than  the  first,  a  run  up  of  size  one 
has  occurred  but  is  not  yet  recorded.  The  second  and  third  numbers  are 
then  examined  in  the  same  way.  If  the  third  is  greater  than  the  second, 
a  run  of  size  two  has  occurred  but  is  not  yet  recorded.  If  the  third  is 
smaller  than  the  second,  a  run  of  size  one  is  recorded  etc,  10 

(d)  Moments:  The  first  four  central  moments  were  computed  and  com- 

r 

pared  with  those  for  the  uniform  distribution. 

(e)  Poker  Test:  This  test  was  used  to  examine  the  sequence  of 
digits  within  a  number.  Starting  at  the  highest  order  bit  position,  the 
digits  were  grouped  to  give  five  octal  digits  (15  bit  positions).  The 
poker  hand  value  of  these  five  digits  was  tallied.  The  next  15  bit 
positions  of  the  same  random  number  were  similarly  examined  and  tallied. 
The  succeeding  generated  numbers  were  examined  in  a  similar  fashion.  A 
chi-square  test  was  performed  on  the  results. 
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U.  Small  Sample  Tests. 


Exploratory  tests  were  made  on  generators  (1-2)  and  (1-3)  using  two 


different  starting  values,  Yg  =  1  and  Yg  =  2^  -  1.  Sample  sizes  of 


100  were  taken  at  each:  A  =  3>10,l3. , .  .,Ij3.  This  was  done  in  order  to 
ascertain,  in  a  general  way,  the  sensitivity  of  the  generators  to 
changes  in  A  .  The  results  presented  in  Tables  1  -  8  of  Chapter  III 
indicate  that  for  these  particular  starting  values  the  selection  of  A 

is  critical.  It  further  confirmed  that  a  good  selection  of  A  was  approx - 

.  i 
li7  2 

imately  (  2  )  for  both  generators.  This  supp 'rted  the  rule  of  thumb 


that  was  offered  by  Greenberger 


Selection  of  Yg  =  1  also  gives  an  indication  of  how  the  generator 
would  respond  when  faced  with  a  low  number.  Cases  have  been  reported 


j?!  where,  when  this  occurs,  an  unfortunate  selection  of  A  will  make 
the  generator  extremely  sluggish  and  can  produce  several  hundred  low 
numbers  before  yielding  a  more  chaotic  output.  This  succession  of  low 
numbers  gives,  in  effect,  a  sequence  all  below  0.3.  Such  a  character¬ 
istic,  obviously,  is  undesired. 

Parallel  procedures  were  followed  using  a  sample  size  of  1000  num¬ 
bers  at  each  A. 

3.  Large  Sample  Tests. 

Having  selected  a  value  of  A  from  the  results  of  the  small  sample 
tests,  five  blocks  of  10,000  numbers  were  generated  and  subjected  to  the 
battery  of  tests. 

\ 

6.  Modifications. 

Experiments  were  conducted  to  investigate  the  quality  of  pseudo¬ 
random  numbers  generated  by  addressing  the  output  of  one  generator  to  a 


second  one  having  a  different  value  of  A.  This  linking  process  for 
equation  (1-2)  and  (1-3)  can  be  described  as  follows: 


Yr 

+ 

C\J 

lil 

Yr-1 

(mod  2^iY) 

Qr 

=  (  2B  +  1 ) 

Yr+  1 

(mod  2^) 

(2-1) 

where  Q^r  = 

1,  2,  ..... 

the  sequence  of  interest. 
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III.  RESULTS  \NU  CONCLUSIONS 


1.  Interpretation  of  Data. 

No  attempt  was  made  to  select  the  best  parameter  A  based  on  the 
small  sample  tests  discussed  in  Chapter  II.  Rather,  the  observed  data 
was  prepared  in  tabular  form  for  each  of  the  generators.  This  allows 
general  comparisons  to  be  made  on  the  effects  of  changing  the  inputs  on 
each  of  the  generators  being  investigated.  In  this  way,  certain  ranges 
of  parameters  are  observed  to  produce  small  samples  whose  performance  is 
consistently  poor  under  the  battery  of  tests.  It  also  permits  a  compari¬ 
son  of  one  generator  with  the  other  as  A  is  varied.  For  if  one  assumes, 
as  will  be  done  in  this  paper,  that  the  generator  that  exhibits  the 
least  number  of  these  "poor"  indications  is  the  preferred  generator 
such  a  tabulation  will  make  it  apparent.  This  assumption  does  not  take 
into  account  the  differences  in  speed  of  generation  between  (1-2)  and 
(1-3).  In  the  CDC  l60li,  this  difference  is  one  add  instruction  or  7.2 
micro-seconds  pier  word  and  although  this  is  a  consideration,  for  pur¬ 
poses  of  this  analysis,  it  will  be  considered  minor.  The  same  can  be 
said  of  the  differences  in  cycle  length  between  the  generators  under 
consideration  because  each  of  them  is  extremely  large.  In  short,  the 
point  of  focus  in  the  discussion  to  follow  is  to  be  centered  on  the  gen¬ 
erators  ability  to  produce  pseudo-random  numbers  that  are  of  practical 
use . 

A  particular  parameter  was  considered  "poor"  if  it  failed  to  pass 
anyone  of  the  following  arbitrarily  selected  criteria: 

(a)  The  frequency  chi-square  statistic,  Xq  must  be 

3.3  —  Xc/^  —  16.9  that  is,  for  9  degrees  of  freedom  the 


Hi 


critical  values  of  the  chi-square  were  selected  as  .05  and  .95. 

(b)  No  long  runs;  where  a  long  run  will  be  considered  a  run  of 
length  six  or  greater  for  samples  of  size  100  and  of  length  seven 
or  greater  in  samples  of  size  1000. 

(c)  Serial  correlation  coefficients  of  lag  one  must  be  less  than  0.2 
for  samples  of  size  100  and  0.05  for  samples  of  size  1000. 

(d)  Mean  greater  than  or  equal  to  0.U30  and  less  than  or  equal  to 
0.570. 

2.  Results  of  Small  Sample  Tests. 

Tables  1  through  8  indicate  the  results  of  varying  A  in  the  gener¬ 
ators 


Y 


(mod  2^7) 


(3-D 


and 


(mod  2 ^7) 


(3-2) 


In  tables  1  through  Ij  =  1  while  in  tables  5  through  8  Y^  =  2^L  -  1  . 
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TABLE  1. 


TEST  RESULTS  OH: 

Yr+l  E(2A+ 3)Yr  (mod  2hl) 


TEST 

A 

THEO. 

5 

10 

12 

20 

29 

30 

39 

1*0 

1*9 

VALUE 

FREQUENCY 

1 

9.2 

1*.8 

10.8 

11.6 

13.6 

10.8 

7.8 

16.2 

17.1* 

~ </2 

X9:09  = 

4 

2 

19.9 

1*.6 

io.  U 

7.1 

U.l* 

8.9 

8.1* 

U.9 

9.9 

16.919 

o^jRXAL 

1 

.132 

.01*9 

-.179 

.219 

.313 

.037 

.11*3 

-.072 

-.190 

0.000 

CORRELATION 

2 

.oo9 

.030 

-.01*2 

.039 

.026 

.01*1* 

.013 

-.032 

.018 

MEAN 

1 

.1*79 

.1*69 

.1*98 

.1*99 

.1*29 

.1*79 

.1*67 

.917 

.1*79 

o.9oo 

2 

.1*88 

.900 

.908 

.1*88 

.'*93 

.1*98 

.1*90 

.900 

.902 

VARIANCE 

1 

.098 

.08!* 

.098 

.071* 

.090 

.101 

.090 

.093 

.069 

0.0833 

2 

.036 

.083 

.082 

.087 

.082 

.081* 

.082 

.081* 

.083 

RUNS  6 

1 

1 

0 

0 

0 

2 

1 

0 

0 

0 

0.0289 

RUNS  7 

2 

0 

0 

0 

0 

1 

1 

0 

0 

1 

0.0388 

1.  Sample  size  100 

2.  Sample  size  1000 


TABLE  2 


TEST  RESULTS  ON: 

Yr+i  =  (2A+ 3)Yr  (mod  2^7) 
Y0  =  1 


TEST 

A 

THS0. 

16 

17 

18 

19 

20 

21 

22 

23 

21* 

VALUE 

FREQUENCY 

1 

15.  1* 

11.1* 

8.0 

8.0 

11.6 

9.0 

16.8 

11.2 

15.1* 

2 

x9:05  = 

2 

12.6 

7.6 

5.1* 

8.3 

7.1 

7-7 

8.8 

16.3 

llt.O 

16.919 

SERIAL 

1 

.088 

.231 

.135 

.156 

.215 

.021* 

-.006 

-.002 

.189 

0.000 

CORRELATION 

2 

-.ol*o 

-.006 

.006 

.036 

.039 

-.005 

-.038 

.019 

.091 

KEAN 

1 

.1*80 

.1*75 

.>*99 

.1*85 

.1*59 

Lzr  : 

i 

.  in  i 

• 

.1*56 

.1*26 

.1*25 

o.5oo 

2 

.523 

.’*92 

.502 

.502 

.1*88 

.501* 

.1*95 

.513 

.1*71 

VARIANCE 

1 

.109 

CO 

On 

O 

6 

.091* 

.086 

.071* 

.083 

.093 

.072 

.0  90 

0.0833 

2 

.081* 

•086 

-3 

CO 

o 

• 

.082 

.087 

.086 

.081* 

.087 

.081* 

RUNS  -6 

1 

i 

1 

i 

0 

0 

0 

0 

0 

0 

1 

0.0285 

! 

RUNS  n  7  j2 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0.0388 

1.  Sample  size  100 
2'.  Sample  size  1000 


TABLE  3 


TEST  RESULTS  ON: 

Yr+  i  =  (2A+l)Yr  +1  (mod  2^7) 


TEST 

A 

THEO. 

VALUE 

5 

10 

15 

20 

25 

30 

35 

l*o 

1*5 

FREQUENCY 

2 

*9 

1 

27.0 

10.6 

23.8 

9.0 

-5o 

-5o 

>5o 

8.8 

>  5o 

Y2 

7 : 05  = 

2 

1*.6 

10.1 

7.2 

5.0 

-5o 

-5o 

8.1 

11.0 

>5o 

16.919 

SERIAL 

1 

.200 

.231* 

.150 

i 

i  •  ; 

O  ' 

UT 

VJT 

.999 

.999 

.91*3 

.029 

.1*26 

0.000 

CORRELATION 

2 

.081 

.030 

.002 

.01*6 

.999 

.982 

.387 

-.018 

-.1*28 

' 

MEAN 

; 

1 

1 

.1*10 

,.1*83 

.1*79 

.1*67 

.1*09 

-.013 

.319 

.1*71* 

.1*27 

0.500 

\2 

i 

.1*87 

.502 

.1*97 

.U97 

.398 

.395 

.1*81 

J*99 

.1*25 

i 

VARIANCE 

|i 

.090 

-.090 

.098 

.071 

.000 

.000 

.086 

.088 

.096 

0.0833 

|2 

.085 

.086 

.032 

.081 

.001 

.087 

.085 

.081 

.100 

RUNS  -6  Jl 

1 

1 

1 

1 

>15 

>15 

>15 

3 

0 

0.0285 

-  .  -  I 

RUNS  2: 7 

2 

1 

1 

0 

1 

>15 

>15 

>15 

>15 

0 

0.0388  | 

1.  Sample  size  100 

2.  Sample  size  1000 


TABLE  9 


TEST  RESULTS  ON; 

Yr+i  =  (2A+i)Yr  +1  (mod  2^7) 
To  -  1 


TEST 

A 

THSO. 

16 

17 

18 

19 

20 

21 

22 

23 

29 

VALUE 

FREQUENCY 

X  ^ 

9 

1 

61 

30 

16.  9 

9.e 

9.0 

9.9 

29.6 

.988 

900 

2 

Xo:o9--- 

2 

19.  9 

7.3 

8.8 

8.9 

9.1 

ii4.e 

26.9 

9919 

900 

16.919 

SERIAL 

1 

.769 

.308 

.189 

.209 

.099 

-.099 

-.099 

-.9o9 

.99 

n  on 

CORRELATION 

2 

.086 

.019 

-.013 

.021 

.ol46 

.027 

-.011 

-.398 

.99 

iUJ 

KEAN 

1 

.3148 

.389 

.1:17 

.967 

.967 

.968 

.390 

.399 

.209 

0.900 

2 

.989 

.999 

.901 

.991 

.997 

.li?l 

.919 

.199 

.199 

VARIANCE 

1 

.099 

.092 

.091 

.099 

.071 

.087 

•  071.4 

.091 

0.00 

0.0833 

2 

.086 

.08h 

.086 

.087 

.081 

.081 

.081 

.091 

0.00 

RUNS  -  6 

1 

li 

1 

1 

1 

1 

0 

0 

0 

>19 

0.0289 

RUNS  L 7 

2 

>  l5 

9 

0 

1 

1 

0 

0 

0 

>19 

0.0388 

1.  Sample  size  100 

2.  Sample  size  1000 


TABIE  5 


TEST  RESULTS  ON: 

Yr+i  =  (2A+  3)  Yr  (mod  2L?) 
Y0  =  2^6-l 


TEST 

A 

THEO. 

9 

10 

15 

20 

25 

30 

35 

bo 

U5 

'  VALUE 

FREQUENCY 

2 

1  17.2 

2.6 

6.1). 

19.0 

12.  h 

20.2 

5-8 

8.U 

11.I) 

4:05  = 

16.919 

X9 

2 

l-  — - 

l*.38 

U.15 

18.5 

17.8 

6.9 

10.3 

7.16 

8.5 

8.b 

SERIAL 

I 

1 

|  0.103 

-.350 

-.077 

.029 

-.01)6 

-.007 

-.0I4I 

.015 

.0U0 

0.00  ! 

! 

1 

CORRELATION 

2 

L.  : 

.032 

-.009 

-.062 

-.025 

.039 

-.010 

-.039 

.022 

-.002 

KEAN 

1 

. 

.U6  9 

r  ........ 

.1*80 

.513 

.527 

.537 

.520 

.511* 

.1)87 

.507 

i 

o.5oo  1 

1 

2 

.506 

.U99 

.1*95 

.5iU 

.1)93 

.520 

.1*97 

.1:88 

.509 

VARIANCE 

1 

.075 

.077 

.073 

.077 

.08U 

.075 

.075 

.085 

.076 

0.0833 

2 

.086 

.085 

.083 

.081 

.085 

.081 

.08U 

.088 

.089 

RUNS  -  6 

1 

i 

0 

1 

0 

0 

0 

0 

0 

0 

0.0285 

RUNS  H  7 

2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0.0388 

1 .  Sample  size  100 

2.  Sample  size  1000 


TABLE  6 


TEST  RESULTS  ON: 


Yr+1 

Yo  = 

=  (2A* 
21*6-! 

3)  Yr 

(mod  2hl 

) 

TEST 

A 

THEO. 

VALUE 

16 

17 

18 

19 

20 

21 

22 

23 

2h 

FREQUENCY 

X  9 

1 

10.  U 

19.  U 

19.2 

9.2 

19.0 

12.6 

7.2 

8.0 

5.U 

2 

x9:05  = 
16.919 

2 

5.o 

9.1 

6.7 

13.8 

17.8 

5.7 

10.6 

9.C 

19.1 

SERIAL 

1 

-.181 

.063 

-.102 

-.030 

.029 

.097 

.021 

-.118 

-.oiU 

0.00 

CORRELATION 

2 

.025 

-.016 

.022 

-.036 

-.025 

-.022 

.023 

-.033 

.0U6 

MEAN 

1 

.513 

.1:29 

.U38 

.511 

.527 

.hU3 

.5oU 

•5oo 

.U55 

0.500 

2 

.507 

.509 

.U91 

.511 

.5lU 

.U86 

.U87 

.507 

.U81 

VARIANCE 

1 

.073 

.068 

.081 

.071 

.077 

.073 

.072 

.096 

.081 

0.0833 

2- 

.083 

.086 

.085 

.081 

.081 

.083 

.082 

.086 

♦  086 

RUNS  -6 

1 

0 

0 

0 

0 

0 

0 

0 

1 

0 

.0285 

.  T- 

RUNS  —  7 

2 

0 

0 

0 

0 

0  * 

0 

0 

1 

0 

.0388 

1.  Sample  size  100 

2.  Sample  size  1000 


TABLE  7 


TEST  RESULTS  ON: 

Yr+1  =  (2A  +  1)  Yr  +1  (mod  2^7) 

Y0  =  2^6-l 


TEST 

A 

THSOR 

5 

10 

15 

20 

25 

30 

35 

Uo 

1:5 

VALUE 

FREQUENCY 

X2 

X  9 

1 

'  7.8 

11.0 

9.0 

8.0 

900 

900 

1:5.8 

5.2 

165 

-- 

*9:05' 

2 

6.8 

12.5 

9.6 

6.8 

uuuij 

103.9 

6.8 

3.5 

17  81: 

16.919 

SERIAL 

1 

-.092 

.0U2 

.073 

.058 

.999 

.999 

.912 

-.113 

-.350 

0.00 

CORRELATION 

.081 

.035 

-.021 

.023 

.999 

.971 

.3U9 

-.oUi 

-.1:21 

MEAN 

1 

.5io 

.1:61: 

.1:69 

.1:71 

.250 

.262 

.U35 

.1:91 

.1:18 

o.5oo 

2 

.503 

.U82 

.1:93 

.1:97 

.289 

.1:83 

.U97 

.5oU 

.1:21: 

VARIANCE 

i - 

.085 

.086 

.092 

.081 

.COO 

.000 

.056 

.076 

.098 

0.0833 

. 

2 

.081: 

.086 

.081: 

.085 

.001 

.070 

.081 

.080 

.100 

RUNS  -6 

1 

1 

1 

1 

1 

-15 

— 15 

-15 

2 

0 

.0285 

RUNS  17 

2 

2 

0 

1 

h 

-15 

>15 

-15 

-15 

0 

.0388 

1.  Sample  size  100 

2.  Sample  size  1000 


TABLE  8 


TEST  RESULTS  ON: 

Yr+1  =  (2A  +  1)  Yr  +1  (mod  2^7) 

Yc  =  2^6-l 


TEST 

A 

THEO. 

16 

17 

18 

19 

20 

21 

22 

23 

21* 

VALUE 

FREQUENCY 

2 

1 

U3.6 

19.8 

11.  u 

16.0 

8.0 

9.2 

31.6 

1*88 

900 

’/2 

9:05  - 
16.919 

i 

X9 

2 

8.0 

12.7 

3.7 

12.6 

6.8 

18.9 

272.1 

1*915 

7U77 

SERIAL 

1 

1 

.61*8 

I.  - . 

.11*7 

.066 

.003 

.058 

-.255 

-.Oii9 

-.1*01* 

.999 

0.000 

! 

CORRELATION 

! 

2 

-.008 

.oi*o 

.003 

.006 

.023 

.ool* 

.017 

-.398 

.999 

KEAN 

1 

.1*1*8 

.1*79 

.1*69 

.1*78 

.1*71 

.1*67 

.1*35 

.395 

.250 

i 

0.500  | 

i 

1 

1 

2 

l . 

.503 

.1*88 

.1*97 

.i486 

.1*97 

.1*71* 

.1*13 

.1*01* 

.270 

VARIANCE 

1 

.061*. 

l _ 

.073 

.073 

.075 

.081 

.082 

.088 

.051 

.000 

- . 1 

0.0833 

2 

L 

.079 

.085 

.085 

.081* 

.085 

.080 

.081 

.051 

.000 

RUNS  -  6 

1 

|1 

i _ 

U 

- - 

2 

2 

1 

1 

0 

0 

0 

>15 

0.0285 

! 

RUNS  >  7 

r 

2 

-15 

1* 

0 

7 

1* . 

0 

0 

0 

>15 

0.0388  1 

1«  Sample  size  100 
2.  Sample  size  1000 


In  applying  the  criterion  of  paragraph  (1),  the  number  of  values 
of  A  reflecting  a  "poor"  sample  in  the  case  of  (3-2)  is  greater  than 
for  (3-1).  Generator  (3-2)  is  very  susceptible  to  long  runs  and,  above 
an  A  of  25,  cannot  be  considered  a  practical  random  number  generator. 
Using  different  starting  values  did  not  improve  the  output  of  (3-2); 
for  Y')  =  1  only  an  A  *  21  passed  and  for  Yq  s  2^-  1  no  value  of  A 
passed.  Even  relaxing  the  criteria  so  that  two  failures  are  allowed 
before  an  A  is  considered  "poor"  only  admits  A  =  18  and  A  =  20  as  being 
acceptable  for  both  starting  values.  Further,  using  a  value  of 
G  =  (.788)  2^7  as  suggested  by  Coveyou  fioj  did  not  make  any  notice¬ 
able  improvement  in  (3-2). 

Table  9  shows  the  acceptable  A's  for  generator  (3-1). 


TABLE  9 


L 

1 

Acceptable  A  1 s 

j _  .  .  ...  _  _ 

1 

10 

\ 

15 

18 

19  21  22 

35 

bo 

2U6_i 

j 

j 

10 

j 

16 

19 

21  22  25 

35 

bo  b5 

The  above  test  results  indicate  that  for  these  criteria  generator  (3-1) 
is  much  less  erratic  than  (3-2).  Six  different  values  of  A, 

(  A  =  10, 19, 21, 22, 35, bO),  are  acceptable  using  both  starting  values  and 
therefore  it  is  suggested  that  A  be  chosen  from  these  values. 

Some  tests  were  also  made  using  the  approach  suggested  by  equation 
(2-1).  Aside  from  some  improvements  in  the  poker  tests  results,  no 
noticeable  improvement-s  were  made.  Using  this  approach  increases  the 
time  of  generation  per  word  and  unless  the  results  registered  a 
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significant  improvement,  it  could  not  be  economically  justified. 

3.  Results  of  Large  Sample  Tests. 

Using  the  findings  of  paragraph  (?)  above,  an  A  =  19  was  selected 
for  the  five  large  samples  to  be  tested.  This  particular  value  was 
selected  over  the  faster  A  *  10  because  it  appeared  to  perform  slightly 
better  than  A  =  10  in  the  poker  tests.  The  results  are  presented  in 
Tables  10  through  17  and  indicate  that  the  samples  satisfactorily  pass 
the  battery  of  tests.  Figure  1  .shows  the  average  serial  correlation  of 
the  five  samples  for  lags  one  to  ten.  Aver  these  lags  the  correlations 
ranged  from  +0,0182  to  -0.0201. 

h.  Remarks  on  Computer  Work. 

All  of  the  tests  used  in  this  paper  were  written  using  FORTRAN 
language.  These  have  been  catalogued  and  are  available  at  the  Computer 
Center  at  the  United  States  Naval  Postgraduate  School.  Programs  using 
an  A  =  19  in  Equation  (3-1)  have  been  written  both  for  Scrap  and  for 
Fortran  and  are  available  in  the  computer  library.  Appendix  B  lists 
the  machine  language  steps  required  to  produce  a  random  number  accord¬ 
ing  to  (3-1)  and  the  additional  steps  required  to  convert  it  to  a 
floating  point  fraction  between  zero  and  one.  These  steps  have  been 
included  to  allow  a  programmer  a  little  more  flexibility  in  using  the 
generator.  In  man}'  cases,  it  may  be  easier  to  include  these  steps 
directly  in  a  routine  rather  than  go  to  a  computer  library  subroutine. 

5 .  Conclusions , 

(1)  To  determine  a  suitable  value  of  A  in  (3-1)  the  methods  used 
in  small  sample  testing  are  useful.  Further  refinements  could  be 
made  by 
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FIGURE  1. 

A'/ERAGE  SERIAL  CORRELATIONS  FOR  TEN  LAGS 
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TABLE  10 


FREQUENCY  TEST 


1 

t 

//I 

n 

B 

#4 

B 

*9 

3.39 

11.66 

5.46 

13.36 

7.41 

PROBABILITY* 

INTERVAL 

(.90,. 95) 

(.20, .30) 

(.30,. 70) 

(.10, .20) 

(.50,. 70) 

X2 

99 

102.44 

91.98 

75.34 

119.52 

94.0 

PROBABILITY 

INTERVAL 

(.50, .25) 

(.50, .75) 

(.95,. 975) 

(.05, .10) 

(.50,. 75) 

"'(c,d)  implie 

r  2 

s  o  -  P  !  Xv 

>  x21^ 

•  Ao  j 

«=  d  where  XQ 

2 

is  the  observed 

value  of  the 
.90  «=  P  r  X 

chi-square 

2  -  3.39  - 

statistic 

.95- 

.  For  example 

,  (.90, .95) 

implies 

TABLE  11 
RUN  TEST 


RUN 

SIZE 

NUMBER  OBSERVED 

"  l 

i 

EXPECTED 

n 

n 

//3 

//  4 

B 

 J 

NUMBER 

1 

4105 

4210 

4166 

4228 

4io4 

4166.78 

2 

1806 

1780 

1822 

1329 

1851 

1833.09 

3 

545 

523 

558 

523 

506 

527.65 

4 

121 

127 

102 

113 

143 

115.04 

i  . . i 

5  * 

25 

26 

19 

16 

18 

20.33 

6 

6 

1 

2 

2 

2 

3.8 

l 
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TABLE  12 


RUN 

TEST  CHI-SQUARE  RESULTS 

n 

fj2 

B 

0h 

#5 

2  ' 

xu 

l*.22 

3.59 

3.60 

2.ue 

9.U6 

PROBABILITY 

INTERVAL 

(.30, .50) 

(.30,. 50) 

(.30,. 50) 

(.50,. 70) 

(.05, .10) 

TABLE  13 

POKER  TEST  (FIRST  FIVE  OCTAL  DIGITS) 


POKER 

HAND 

n 

#2 

#3 

#1* 

#5 

EXP. 

NUMBER 

BUST 

2001* 

2027 

2000 

2010 

1979 

2050.78 

ONE  PAIR 

5073 

5091 

5109 

5205 

5163 

5126.95. 

TWO  PAIR 

1623 

1597 

1578 

1U75 

1616 

1538.09 

THREE  OF  A  KIND 

1035 

1057 

1050 

10U6 

991 

1025.39 

FULL  HOUSE 

167 

11*5 

168 

165 

168 

170.90 

FOUR  OF  A  KIND 

96 

81 

92 

98 

83 

85.1*5 

FIVE  OF  A  KIND 

2 

2 

3 

1 

0 

2 .1*1* 
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TABLE  lh 

POKER  TEST  CHI-SQUARE  RESULTS  (FIRST  FIVE  OCTAL  DIGITS) 


• 

n 

#2 

#3 

#5 

2 

x5 

5.75 

7.68 

3.59 

6.61 

8.18 

PROBABILITY 

INTERVAL 

(.30, .5o) 

(.10, .20) 

(.50, .70) 

(.20, .30) 

(.10, .20) 

TABLE  15 

CHARACTERISTICS  OF  GENERATED  UNIFORM  DISTRIBUTION 
(FIVE  SAMPLES  OF  10,000) 


1 

2 

3 

h 

5 

EXP. 

VALUE 

MEAN 

.50103 

.5058U 

.50076 

.50358 

.50027 

0.50000 

VARIANCE  • 

.08339 

.08U813 

.08357 

.08306 

.081*77 

0.83333 

SKEWNESS 

.00029 

.00073 

.OOOlLi 

.00002 

.00001 

0.00000 

KURTOSIS 

.01255 

.01288 

.012U6 

.01238 

.012781 

0.01250 
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TABLE  16 


POKER  TEST  (SECOND  FIVE  OCTAL  DIGIT  POSITIONS) 


1 - 

POKER 

HAND 

n 

#2 

#3 

#1* 

#9 

EXP. 

NUMBER 

BUST 

20# 

20# 

2069 

1997 

2061 

2090.78 

ONE  PAIR 

#18 

#01* 

9196 

9H*8 

9128 

9126.99 

WO  PAIR 

#39 

i960 

1920 

1961 

1939 

1938.09 

THREE  OF  A  KIND 

1029 

1008 

1011 

1032 

999 

1029.39 

FULL  HOUSE 

168 

192 

166 

160 

167 

170.90 

FOUR  OF  A  KIND 

89 

79 

79 

101 

89 

89.1*9 

FIVE  OF  A  KIND 

2 

1* 

7 

1 

9 

2 .1*1* 

*Four  and  Five  of  a  Kind  combined  for  9  d.  of  f. 


TABLE  37 

POKER  TEST  CHI-SQUARE  RESULTS  (SECOND  FIVE  OCTAL  DIGIT  POSITIONS) 


i 

i 

i 

( 

//I 

#2 

#3 

#1* 

#9 

2 

X5 

0.18 

0.982 

1.32 

1* .  89 

1.86 

PROBABILITY 

INTERVAL 

^  .99 

(.98-. 991 

(.99-. 90) 

(.90-. 30) 

(90-. 80) 

30 


(a)  selecting  a  few  more  starting  values,  Yq,  and  tabulating 
the  results  as  in  Tables  1  through  li. 

(b)  modifying  the  criterion  used  in  the  sifting  process  to 
meet  an  individual's  unique  demands.  For  example,  if  he 
desired  an  unbiased  sequence  of  uniforms,  he  could  tighten 
the  restriction  on  the  acceptability  limits  of  the  mean. 

(2)  Based  on  the  test  results.  Equation  (3-1)  is  to  be  preferred 
to  (3-2)  as  a  generator  of  random  numbers.  The  speed  advantage  in  (3-2) 
of  about  7.2  micro-seconds  per  word  is  masked  by  its  demonstrated  ten¬ 
dency  toward  long  runs  and  poor  frequency  distributions  for  most 
values  of  A. 

(3)  Five  blocks  of  10,000  numbers  each  were  tested  using  gener¬ 
ator  (3-1)  with  A  =  19.  All  tests  were  passed  satisfactorily  and  it 
is  concluded  that 

Yr+1  =  (219  +  3)Yr  (mod  2hj)  (3-3) 

will  perform  as  a  pseudo-random  generator  for  the  CDC  loOiu 

(U)  The  chaining  of  generators  as  suggested  in  Equation  (2-1)  did 
not  lead  to  any  positive  indications  that  a  better  sequence  of  pseudo¬ 
random  numbers  would  result.  Unless  this  can  be  shown,  the  additional 
time  of  the  generation  is  not  warranted. 
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APPENDIX  A 


G.3NERAT  OR  CHARACT SCHIST  ICS 

1.  General. 

In  19149,  Lehmer  3  I  suggested  using  the  multiplicative  congraen- 
tial  method  for  generating  pseudo-random  numbers  on  a  digital  computer. 
Since  that  time,  this  method  has  been  used,  tested  and  reported  on  by  a 


number  of  investigations,  for  example  ,12 

1  J 

method  on  a  binary  computer  of  word  size  P,  the  recurrence  relation  most 
frequently  takes  the  form 


13 


In  using  this 


Vl  =  LYr 


(mod  2"  ) 


(A-l) 


p 

where  L  is  a  fixed  odd  integer  and  0-=Yr  —  2  -1  being  relatively  prime 
to  2P.  This  sequence  is  periodic,  its  length  depending  on  the  choice  of 
L,  Yr  and  P.  When  the  indicated  iteration  is  performed,  say  n  times, 
(1-1)  becomes: 

Yr+n  =  (L)n  Yr  (mod  2?) 

and  for  this  reason  is  sometimes  referred  to  as  the  power  residue  method. 

The  numbers  Yr/^P  (r =  0,1,2 ... .  )  become  the  desired  uniformly  dis- 

r  *| 

tributed  elements  from  the  unit  interval.  Barnett  lUj  in  1961  showed 

that  necessary  and  sufficient  conditions  to  obtain  a  maximum  cycle  of 
P-2 

2  elements,  all  of  which  are  distinct,  are: 


(a)  L  =  3  (mod  8)  or  I,  ~  c>  (mod  P)  (A-2) 

(b)  Yn=  1  (mod  2),  (A-3) 

that  is,  Y-,  must  be  odd. 

If  these  conditions  are  not  met  the  cycle  length  can  be  affected 
and  in  some  cases  the  process  degenerates  to  zero. 
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2.  Discussion  of  Equation  (1-2). 

While  the  sequence  produced  by  (1-1)  has  been  found  satisfactory, 
nevertheless,  it  does  imply  the  use  of  the  multiplication  instruction  in 
the  computer.  Relatively  speaking,  this  is  a  time-consuming  operation 
and  interest  continued  on  methods  that  would  avoid  its  use  and  still  pro¬ 
duce  the  desired  sequence.  Greenberger  ; h  proposed  using  an  L  of  the 

A 

form  (2  +3)  where  2  —  A  <  P.  It  is  seen  that  such  an  I,  satisfies 

P-2 

the  conditions  necessary  to  insure  a  cycle  of  2  .  This  form  has  the 

advantage  of  substituting  a  shift  operation  of  size  A  and  three  add 
instructions  for  the  multiplication  operation.  In  the  GDC  l60li,  the 
multiplication  instruction  15  requires  25.2  plus  .8n  micro-sounds; 
where  n  is  the  number  of  ones  in  the  multiplier.  On  the  other  hand,  the 
shift  instruction  takes  2.8  plus  .ks  micro-sounds,  where  s  is  the  number 
of  places  shifted  and  an  add  instruction  takes  7.2  micro-sounds.  In 
operations  where  large  quantities  of  random  numbers  are  required  this 
saving  can  make  a  significant  contribution  toward  reducing  machine  time. 

When  this  form  of  L  is  translated  to  the  GDC  loOlj,  equation  (1-1) 
becomes: 


Yr+i  =  (2A  +  3)  Yr 


(mod  2hl ) 


(A-2) 


where  with  2  ^  A  -=  h7  the  period  of  the  sequence  is  2^,  that  is, 

2^  distinct  numbers  are  produced  before  the  sequence  repeats.  Each  of 
the  numbers  produced  are  odd  and  belongs  to  one  of  two  mutually  exclu¬ 
sive  integer  sets  depending  on  the  selection  of  Y0  where,  if  r  belongs 

to  one  set  r+2  cannot.  These  two  sets  between  them  exhaust  all  of  the 

U7 

possible  odd  numbers  in  the  interval  1  to  2  .  Because  the  numbers  are 
odd  this  means  that  the  two  least  significant  bit  positions  in  the 
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binary  representation  of  a  number  will  not  change,  that  is,  their  cycle 
is  zero.  The  third  bit  position  has  period  2"*',  the  fourth  2^  and  so  on. 
For  this  reason,  if  random  digits  are  to  be  used  only  the  most  signifi¬ 
cant  digits  of  a  number  should  be  considered. 

No  mathematical  rules  have  been  developed  to  accurately  determine  a 
particular  choice  of  A  although  it  is  clear  that  an  unfortunate  selection 
here  can  result  in  an  unacceptable  generator.  Coveyou  ill  did  offer 
a  criterion  for  selecting  A  in  terms  of  an  approximation  formula  he 
derived  for  the  reduction  of  serial  correlation  between  the  numbers. 
Subsequently,  Greenberger  5>  introduced  a  correction  term  to  be 
applied  to  Coveyou' s  expression  and  demonstrated  that  even  though  serial 
correlation  might  be  optimized,  nonetheless,  the  generator  might  suffer 
from  other  shortcomings,  such  as  producing  long  sequences  of  low  numbers. 

3.  Discussion  of  Equation  (1-3) 

Rotenberg  1 6  tested  a  sequence 

Yr+1  5  L  Yr+  C  (mod  2P)  (A-l4) 

where  L  and  C  are  odd  integers  less  than  P  and  0  ^  Yr^  2P.  In  this 
case,  a  maximum  period  of  2P  is  obtained  for  any 

L  -  1  (mod  U)  (A-5) 

An  L  of  the  form  (2^  +  1)  with  A  —  2  satisfies  the  condition  of  (A-5)  and 
was  used  by  Rotenberg  in  his  tests.  He  examined  two  values  of  C, 

0=1  and  C  =  (.788)  2  ,  the  latter  value  resulted  from  Coveyou' s  approxi¬ 
mation  mentioned  above.  Rotenberg  reported  that  there  were  no  significant 
differences  in  his  results  using  either  one  of  these  C  values. 
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For  the  CDC  l60lt,  this  generator  becomes 

Yr+i  -  (2a  +  1)  Yr  +  C  (mod  2^)  (A-6) 

This  type  has  an  edge  on  form  (A-2)  in  that  it  requires  one  less  add 
instructor  for  each  new  number.  It  also  has  a  longer  period  although 
this  does  not  seem  too  significant  in  that  ( A-b )  will  produce 
distinct  elements. 

U.  Sub  Periods. 

It  has  been  pointed  out  by  Peach  16  that  in  the  case  of  (A-6) 
certain  harmonics  exist  throughout  its  period.  These  tend,  he  feels, 
to  lend  a  greater  stability  to  the  generated  numbers  than  should  be  ex¬ 
pected  of  a  random  sample.  He  illustrates  his  point  by  making  frequency 
tests  on  large  samples  and  notes  that  they  appear  to  be  unduly  uniform, 
although  his  results  do  not  overwhelmingly  indicate  this.  Whether  too 
much  stability  is  objectionable  depends  on  the  particular  application. 

If  a  user  wanted  to  estimate  an  unknown  mean  without  bias  then  this 
characteristic  might  not  be  critical.  If  the  variance  is  a  point  of 
interest  too  much  stability  could  be  critical. 

5.  An  Example. 

To  illustrate  the  multiplicative  congruentiel  method  consider 


equations  .(A-h)  which  with  L  =  Y  =  0,  C  =  h,  P  =  3  becomes 


5  «r  * 


(mod  2J) 

The  sequence  is  determined  as  follows. 
Substituting  Y  =  0  into  equation  (A-7)  gives 

Y-i  =  5  (mod  23) 


(A-7) 
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This  means  that  Yp  is  the  remainder  after  dividing  5  by  2  .  This  give 
Yp  =  5.  Substituting  this  back  into  (A-?)  gives: 

Y ?  =5  l 6}  (mod  2' ) 

Dividing  30  by  2J  gives  a  remainder  6.  Therefore  Y2  =  6.  This  is 
repeated  giving  a  sequence  Y]  =5,  Y2  =  0,  Y^  =  3,  Yj^  =  >4,  Y^  =  1 
Y5  =  2,  Yy  =  7.,  Yp  =  0.  of  period  2-\ 


APPENDIX  B 


PROGRAMING  THE  GENERATOR 


Method 

1.  The  Relation. 


Y. 


r  +  1 


(21?  ~  3)  Yr  (mod  2^7) 


Y  =  2 
io  c 


1*6  -  1 


can  be  programmed  for  the  GDC  l6oU  as  follows: 

(a)  Define:  MASK  =  J4OOOOOOOOOOOOOOO  Binary 
MASK2  =  2000000000000000  Binary 


MACHINE  LANGUAGE  ROUTINE: 
ENQ  (0) 

LDA  (X0) 

LLS  (19) 

SCL  (MASK) 

ADD  (XQ) 

ADD  (X0) 

ADD  (X0) 

3TA  (X0) 


(b)  At  this  point,  the  random  number  is  in  the  accumulator  and  has 
been  stored  in  Xn  in  preparation  for  the  next  number  if  required.  It 
is  now  required  to  convert  it  from  its  present  integer  form  to  a  float¬ 
ing  point  fraction  on  the  unit  interval.  This  is  done  by  continuing  on 
with  the  following  instructions: 
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Aft 3  (11) 

ADI)  (MASK2) 
FAD  (MASK2) 


At  this  time,  the  desired  uniformly  distributed  o,l  number  is  in 
the  A-  register.  The  time  of  generation  is  about  90  micro-seconds  per 
number . 


Uo 


